Challenges You Will Face When Parsing PDFs with Python
theseattledataguy.comยท12hยท
Discuss: Hacker News
๐Ÿ“„PDF Archaeology
A Simple Guide to Keyword Clustering with spaCy
dev.toยท6hยท
Discuss: DEV
๐ŸฐMedieval Parsing
<h2>Resurrected - Two Latin Texts</h2>
naomiceder.techยท1h
๐Ÿ”คFont Archaeology
Nipdf: PDF Reader in Rust
github.comยท9hยท
Discuss: Hacker News
๐Ÿฆ€Rust Macros
Bookends 15.2
tidbits.comยท11h
๐Ÿ“„PostScript
OpenAI releases GPT-5 Codex designed for bug fixes and code generation
the-decoder.comยท7h
๐Ÿ”“Open Source Software
Linkage
11011110.github.ioยท10h
๐Ÿ“Linear Algebra
Work in Progress
i.redd.itยท2dยท
Discuss: r/homelab
๐Ÿฆ‹Format Metamorphosis
In-depth Review of Emacs tree-sitter integration
archive.casouri.ccยท4hยท
Discuss: Lobsters
๐ŸŒณIncremental Parsing
Semantic Dictionary Encoding
falvotech.comยท12hยท
Discuss: Hacker News
๐ŸŒ€Brotli Dictionary
OTW - Bandit Level 4 to Level 5
tbhaxor.comยท21h
๐Ÿ”งKAITAI
Lessons from using AI in Discovery
thoughtbot.comยท1d
๐Ÿ•ต๏ธMetadata Mining
Show HN: Semlib โ€“ Semantic Data Processing
github.comยท13hยท
Discuss: Hacker News
๐ŸŒณIncremental Parsing
Defect Chemistry: Automated Compositional Analysis of Amorphous Solid Electrolytes via Dynamic Spectroscopy
dev.toยท6hยท
Discuss: DEV
๐ŸŒˆSpectroscopy
How to Remove Invisible Characters From AI Text (Free Tool)
hackernoon.comยท1d
โœ๏ธOCR Correction
Docling: The Document Alchemist
towardsdatascience.comยท3d
๐Ÿ“‹Document Grammars
WorldCat Editions and Holdings Release
annas-archive.orgยท1dยท
Discuss: Hacker News
๐Ÿ“šMARC Records
Slidebee โ€“ turn any ArXiv paper into a presentation
slidebee.genmini.aiยท1dยท
Discuss: Hacker News
๐Ÿ“Concrete Syntax
Top 11 Document Parsing AI Tools for developers in 2025
dev.toยท2dยท
Discuss: DEV
๐Ÿ“„Document Digitization